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Claims 



What is claimed is: 

1 . A circuit, comprising: 

a flrs^ operation unit operably coupled to receive a first set of operands, wherein 
the first operation unit combines the first set of operands to produce a first operation 
result, wherein subsequent to receipt of the first set of operands, the first operation unit 
receives a second set of operands and combines the second set of operands to produce a 
second operation result; 

a pre-accumulati^n register operably coupled to the first operation unit, wherein 
the pre-accimiulation register stores the first operation result to produce a buffered first 
operation result; 

a second operation unit operably coupled to receive a third set of operands, 
wherein the second operation imi^ combines the third set of operands to produce a third 
operation result, wherein a first opei^and of the third set of operands is the second 
operation result; 

memory operably coupled to the second operation unit, wherein the memory 
stores the third operation result in one of a plurality of memory locations; and 

a selection block operably coupled to the memory, the second operation unit, the 
pre-accumulation register, and the first operation, unit, wherein the selection block selects 
a second operand of the third set of operands firom' a set of potential operands, wherein 
the set of potential operands includes the second operation result, the buffered first 
operation result stored in the pre-accxmiulation register, and data stored in at least one of 
the plurality of memory locations of the memory, whereip the selection block selects the 




second operand of the third set of operands based on operand selection information 
included in an operational code received by the circuit. 

2. The circuit of claim 1 , wherein the selection block is operably coupled to receive „ 
at least one additional operand, wherein the at least one additional operand is included in 
the set of potential operands. 

3. The circuit of claim 1 further comprises accumulation buffer operably coupled to 
the second operation unit and the selection block, wherein the accumulation buffer stores 
the third operation result to produce a buffered third operation result, wherein the 
buffered third operation result is included in the set of potential operands. 

4. The circuit of claim 1 , wherein the first operation unit is a multiplier. 



5. 



The circuit of claim 4, wherein the second operation unit is an adder. 
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6. A circuit, comprising: 

a first operation unit operably coupled to receive a first set of operands, wherein 
the first operation unit combines the first set of operands to produce a first operation 
5 result, wherein subsequent to receipt of the first set of operands, the first operation unit 
receives a second set of operands and combines the second set of operands to produce a 
second operation result; 



1 0 unit, wherein each pre-accumulation register of the plurality of pre-accumulation 
registers corresponds to a different thread of a plurality of threads, 

wherein when the first operation result produced by the first operation unit is a 
result of an operation for a selected thread of the plurality of threads, a pre-accumulation 
register that corresponds to the selected thread stores the first operation result to produce 

15 a buffered first operation result corresponding to the selected thread; 

a second operation unit operably coupled to receive a third set of operands, 
wherein the second operation unit combines the third set of operands to produce a third 
operation result, wherein a first operand of the third set of operands is the second 
20 operation result; 

memory operably coupled to the second operation unit, wherein the memory 
stores the third operation result in one of a plurality of memory locations; and 

25 a selection block operably coupled to the memory, the second operation unit, the 

plurality of pre-accumulation registers, and the first operation unit, wherein the selection 
block selects a second operand of the third set of operands fi-om a set of potential 
operands, wherein the set of potential operands includes the second operation result, the 
buffered first operation result stored in the plurality of pre-accumulation registers, 

30 additional buffered results stored in the plurality of pre-accumulation registers, and data 
stored in at least one of the plurality of memory locations of the memory, wherein the 



a plurality of pre-accumulation registers operably coupled to the first operation 
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selection block selects the second operand of the third set of operands based on operand 
selection information included in an operational code received by the circuit. 

7. The circuit of claim 6, wherein the selection block is operably coupled to receive 
at least one additional operand, wherein the at least one additional operand is included in 
the set of potential operands. 

8. The circuit of claim 6 further comprises an accumulation buffer operably coupled 
to the second operation unit and the selection block, wherein the accumulation buffer 
stores the third operation result to produce a buffered third operation result, wherein the 
buffered third operation result is included in the set of potential operands. 

9. The circuit of claim 6, wherein the first operation unit is a multiplier. 

10. The circuit of claim 9, wherein the second operation xmit is an adder. 

1 1 . The circuit of claim 6, wherein the first operation unit receives a fourth set of 
operands, wherein the first operation unit combines the fourth set of operands to produce 
a fourth operation result, wherein the first operation unit combines the fourth set of 
operands subsequent to combining the first set of operands and prior to combining the 
second set of operands. 
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12. A method for executing operation codes in a computation block, comprising: 
receiving a first set of operands during a first cycle; 

combining the first set of operands using a first operation imit during the first 
cycle to produce a first operation result; 

storing the first operation result in a pre-accumulation buffer dxiring a second 
cycle to produce a buffered first operation result; 

receiving a second set of operands during the second cycle; 

combining the second set of operands using the first operation unit during the 
second cycle to produce a second operation result; 

selecting a first operand of a third set of operands from a set of potential operands 
that includes the buffered first operation resuh, wherein selecting the first operand of the 
third set of operands occurs during a third cycle, wherein selecting the first operand of the 
third set of operands is based on a current operation code; and 

combining the second operation result and the buffered first operation result using 
a second operation unit during the third cycle to produce a third operation result. 

13. The method of claim 12 fiuther comprises storing the third operation result in a 
memory during a fourth cycle, wherein the set of potential operands includes previously 
determined operation results stored in the memory. 

14. The method of claim 12, wherein storing the first operation result fiirther 
comprises storing the first operation result in a pre-accimiulation buffer of a plurality of 
pre-accumulation buffers, wherein the pre-accumulation buffer is selected based on a 
thread of a plurality of threads to which the current operation code corresponds, wherein 




the set of potential operands includes previously buffered results stored in each of the 
plurality of pre-accumulation buffers. 



